Biblioteca Digital

34 resultados para Genetic Association Studies

em DigitalCommons@The Texas Medical Center

Concept, design and implementation of a cardiovascular gene-centric 50 k SNP array for large-scale genomic association studies.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

A wealth of genetic associations for cardiovascular and metabolic phenotypes in humans has been accumulating over the last decade, in particular a large number of loci derived from recent genome wide association studies (GWAS). True complex disease-associated loci often exert modest effects, so their delineation currently requires integration of diverse phenotypic data from large studies to ensure robust meta-analyses. We have designed a gene-centric 50 K single nucleotide polymorphism (SNP) array to assess potentially relevant loci across a range of cardiovascular, metabolic and inflammatory syndromes. The array utilizes a "cosmopolitan" tagging approach to capture the genetic diversity across approximately 2,000 loci in populations represented in the HapMap and SeattleSNPs projects. The array content is informed by GWAS of vascular and inflammatory disease, expression quantitative trait loci implicated in atherosclerosis, pathway based approaches and comprehensive literature searching. The custom flexibility of the array platform facilitated interrogation of loci at differing stringencies, according to a gene prioritization strategy that allows saturation of high priority loci with a greater density of markers than the existing GWAS tools, particularly in African HapMap samples. We also demonstrate that the IBC array can be used to complement GWAS, increasing coverage in high priority CVD-related loci across all major HapMap populations. DNA from over 200,000 extensively phenotyped individuals will be genotyped with this array with a significant portion of the generated data being released into the academic domain facilitating in silico replication attempts, analyses of rare variants and cross-cohort meta-analyses in diverse populations. These datasets will also facilitate more robust secondary analyses, such as explorations with alternative genetic models, epistasis and gene-environment interactions.

Functional data analysis approaches for genotype-phenotype association studies from next-generation sequencing

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Next-generation DNA sequencing platforms can effectively detect the entire spectrum of genomic variation and is emerging to be a major tool for systematic exploration of the universe of variants and interactions in the entire genome. However, the data produced by next-generation sequencing technologies will suffer from three basic problems: sequence errors, assembly errors, and missing data. Current statistical methods for genetic analysis are well suited for detecting the association of common variants, but are less suitable to rare variants. This raises great challenge for sequence-based genetic studies of complex diseases.^ This research dissertation utilized genome continuum model as a general principle, and stochastic calculus and functional data analysis as tools for developing novel and powerful statistical methods for next generation of association studies of both qualitative and quantitative traits in the context of sequencing data, which finally lead to shifting the paradigm of association analysis from the current locus-by-locus analysis to collectively analyzing genome regions.^ In this project, the functional principal component (FPC) methods coupled with high-dimensional data reduction techniques will be used to develop novel and powerful methods for testing the associations of the entire spectrum of genetic variation within a segment of genome or a gene regardless of whether the variants are common or rare.^ The classical quantitative genetics suffer from high type I error rates and low power for rare variants. To overcome these limitations for resequencing data, this project used functional linear models with scalar response to develop statistics for identifying quantitative trait loci (QTLs) for both common and rare variants. To illustrate their applications, the functional linear models were applied to five quantitative traits in Framingham heart studies. ^ This project proposed a novel concept of gene-gene co-association in which a gene or a genomic region is taken as a unit of association analysis and used stochastic calculus to develop a unified framework for testing the association of multiple genes or genomic regions for both common and rare alleles. The proposed methods were applied to gene-gene co-association analysis of psoriasis in two independent GWAS datasets which led to discovery of networks significantly associated with psoriasis.^

Association of retinoic acid receptor genes with meningomyelocele.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

BACKGROUND: Neural tube defects (NTDs) occur in as many as 0.5-2 per 1000 live births in the United States. One of the most common and severe neural tube defects is meningomyelocele (MM) resulting from failed closure of the caudal end of the neural tube. MM has been induced by retinoic acid teratogenicity in rodent models. We hypothesized that genetic variants influencing retinoic acid (RA) induction via retinoic acid receptors (RARs) may be associated with risk for MM. METHODS: We analyzed 47 single nucleotide polymorphisms (SNPs) that span across the three retinoic acid receptor genes using the SNPlex genotyping platform. Our cohort consisted of 610 MM families. RESULTS: One variant in the RARA gene (rs12051734), three variants in the RARB gene (rs6799734, rs12630816, rs17016462), and a single variant in the RARG gene (rs3741434) were found to be statistically significant at p < 0.05. CONCLUSION: RAR genes were associated with risk for MM. For all associated SNPs, the rare allele conferred a protective effect for MM susceptibility.

Association of folate receptor (FOLR1, FOLR2, FOLR3) and reduced folate carrier (SLC19A1) genes with meningomyelocele.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

BACKGROUND: Meningomyelocele (MM) results from lack of closure of the neural tube during embryologic development. Periconceptional folic acid supplementation is a modifier of MM risk in humans, leading toan interest in the folate transport genes as potential candidates for association to MM. METHODS: This study used the SNPlex Genotyping (ABI, Foster City, CA) platform to genotype 20 single polymorphic variants across the folate receptor genes (FOLR1, FOLR2, FOLR3) and the folate carrier gene (SLC19A1) to assess their association to MM. The study population included 329 trio and 281 duo families. Only cases with MM were included. Genetic association was assessed using the transmission disequilibrium test in PLINK. RESULTS: A variant in the FOLR2 gene (rs13908), three linked variants in the FOLR3 gene (rs7925545, rs7926875, rs7926987), and two variants in the SLC19A1 gene (rs1888530 and rs3788200) were statistically significant for association to MM in our population. CONCLUSION: This study involved the analyses of selected single nucleotide polymorphisms across the folate receptor genes and the folate carrier gene in a large population sample. It provided evidence that the rare alleles of specific single nucleotide polymorphisms within these genes appear to be statistically significant for association to MM in the patient population that was tested.

Single nucleotide polymorphism (SNP) selection for genotype-phenotype association studies

Relevância:

100.00% 100.00%

Publicador:

Resumo:

With hundreds of single nucleotide polymorphisms (SNPs) in a candidate gene and millions of SNPs across the genome, selecting an informative subset of SNPs to maximize the ability to detect genotype-phenotype association is of great interest and importance. In addition, with a large number of SNPs, analytic methods are needed that allow investigators to control the false positive rate resulting from large numbers of SNP genotype-phenotype analyses. This dissertation uses simulated data to explore methods for selecting SNPs for genotype-phenotype association studies. I examined the pattern of linkage disequilibrium (LD) across a candidate gene region and used this pattern to aid in localizing a disease-influencing mutation. The results indicate that the r2 measure of linkage disequilibrium is preferred over the common D′ measure for use in genotype-phenotype association studies. Using step-wise linear regression, the best predictor of the quantitative trait was not usually the single functional mutation. Rather it was a SNP that was in high linkage disequilibrium with the functional mutation. Next, I compared three strategies for selecting SNPs for application to phenotype association studies: based on measures of linkage disequilibrium, based on a measure of haplotype diversity, and random selection. The results demonstrate that SNPs selected based on maximum haplotype diversity are more informative and yield higher power than randomly selected SNPs or SNPs selected based on low pair-wise LD. The data also indicate that for genes with small contribution to the phenotype, it is more prudent for investigators to increase their sample size than to continuously increase the number of SNPs in order to improve statistical power. When typing large numbers of SNPs, researchers are faced with the challenge of utilizing an appropriate statistical method that controls the type I error rate while maintaining adequate power. We show that an empirical genotype based multi-locus global test that uses permutation testing to investigate the null distribution of the maximum test statistic maintains a desired overall type I error rate while not overly sacrificing statistical power. The results also show that when the penetrance model is simple the multi-locus global test does as well or better than the haplotype analysis. However, for more complex models, haplotype analyses offer advantages. The results of this dissertation will be of utility to human geneticists designing large-scale multi-locus genotype-phenotype association studies. ^

The information bottleneck method for genome-wide association studies

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In population studies, most current methods focus on identifying one outcome-related SNP at a time by testing for differences of genotype frequencies between disease and healthy groups or among different population groups. However, testing a great number of SNPs simultaneously has a problem of multiple testing and will give false-positive results. Although, this problem can be effectively dealt with through several approaches such as Bonferroni correction, permutation testing and false discovery rates, patterns of the joint effects by several genes, each with weak effect, might not be able to be determined. With the availability of high-throughput genotyping technology, searching for multiple scattered SNPs over the whole genome and modeling their joint effect on the target variable has become possible. Exhaustive search of all SNP subsets is computationally infeasible for millions of SNPs in a genome-wide study. Several effective feature selection methods combined with classification functions have been proposed to search for an optimal SNP subset among big data sets where the number of feature SNPs far exceeds the number of observations. ^ In this study, we take two steps to achieve the goal. First we selected 1000 SNPs through an effective filter method and then we performed a feature selection wrapped around a classifier to identify an optimal SNP subset for predicting disease. And also we developed a novel classification method-sequential information bottleneck method wrapped inside different search algorithms to identify an optimal subset of SNPs for classifying the outcome variable. This new method was compared with the classical linear discriminant analysis in terms of classification performance. Finally, we performed chi-square test to look at the relationship between each SNP and disease from another point of view. ^ In general, our results show that filtering features using harmononic mean of sensitivity and specificity(HMSS) through linear discriminant analysis (LDA) is better than using LDA training accuracy or mutual information in our study. Our results also demonstrate that exhaustive search of a small subset with one SNP, two SNPs or 3 SNP subset based on best 100 composite 2-SNPs can find an optimal subset and further inclusion of more SNPs through heuristic algorithm doesn't always increase the performance of SNP subsets. Although sequential forward floating selection can be applied to prevent from the nesting effect of forward selection, it does not always out-perform the latter due to overfitting from observing more complex subset states. ^ Our results also indicate that HMSS as a criterion to evaluate the classification ability of a function can be used in imbalanced data without modifying the original dataset as against classification accuracy. Our four studies suggest that Sequential Information Bottleneck(sIB), a new unsupervised technique, can be adopted to predict the outcome and its ability to detect the target status is superior to the traditional LDA in the study. ^ From our results we can see that the best test probability-HMSS for predicting CVD, stroke,CAD and psoriasis through sIB is 0.59406, 0.641815, 0.645315 and 0.678658, respectively. In terms of group prediction accuracy, the highest test accuracy of sIB for diagnosing a normal status among controls can reach 0.708999, 0.863216, 0.639918 and 0.850275 respectively in the four studies if the test accuracy among cases is required to be not less than 0.4. On the other hand, the highest test accuracy of sIB for diagnosing a disease among cases can reach 0.748644, 0.789916, 0.705701 and 0.749436 respectively in the four studies if the test accuracy among controls is required to be at least 0.4. ^ A further genome-wide association study through Chi square test shows that there are no significant SNPs detected at the cut-off level 9.09451E-08 in the Framingham heart study of CVD. Study results in WTCCC can only detect two significant SNPs that are associated with CAD. In the genome-wide study of psoriasis most of top 20 SNP markers with impressive classification accuracy are also significantly associated with the disease through chi-square test at the cut-off value 1.11E-07. ^ Although our classification methods can achieve high accuracy in the study, complete descriptions of those classification results(95% confidence interval or statistical test of differences) require more cost-effective methods or efficient computing system, both of which can't be accomplished currently in our genome-wide study. We should also note that the purpose of this study is to identify subsets of SNPs with high prediction ability and those SNPs with good discriminant power are not necessary to be causal markers for the disease.^

Genetic Predictors of Clinical Outcomes in Non-Small Cell Lung Cancer Patients

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Lung cancer is the leading cause of cancer-related mortality in the US. Emerging evidence has shown that host genetic factors can interact with environmental exposures to influence patient susceptibility to the diseases as well as clinical outcomes, such as survival and recurrence. We aimed to identify genetic prognostic markers for non-small cell lung cancer (NSCLC), a major (85%) subtype of lung cancer, and also in other subgroups. With the fast evolution of genotyping technology, genetic association studies have went through candidate gene approach, to pathway-based approach, to the genome wide association study (GWAS). Even in the era of GWAS, pathway-based approach has its own advantages on studying cancer clinical outcomes: it is cost-effective, requiring a smaller sample size than GWAS easier to identify a validation population and explore gene-gene interactions. In the current study, we adopted pathway-based approach focusing on two critical pathways - miRNA and inflammation pathways. MicroRNAs (miRNA) post-transcriptionally regulate around 30% of human genes. Polymorphisms within miRNA processing pathways and binding sites may influence patients’ prognosis through altered gene regulation. Inflammation plays an important role in cancer initiation and progression, and also has shown to impact patients’ clinical outcomes. We first evaluated 240 single nucleotide polymorphisms (SNPs) in miRNA biogenesis genes and predicted binding sites in NSCLC patients to determine associations with clinical outcomes in early-stage (stage I and II) and late-stage (stage III and IV) lung cancer patients, respectively. First, in 535 early-stage patients, after correcting multiple comparisons, FZD4:rs713065 (hazard ratio [HR]:0.46, 95% confidence interval [CI]:0.32-0.65) showed a significant inverse association with survival in early stage surgery-only patients. SP1:rs17695156 (HR:2.22, 95% CI:1.44-3.41) and DROSHA:rs6886834 (HR:6.38, 95% CI:2.49-16.31) conferred increased risk of progression in the all patients and surgery-only populations, respectively. FAS:rs2234978 was significantly associated with improved survival in all patients (HR:0.59, 95% CI:0.44-0.77) and in the surgery plus chemotherapy populations (HR:0.19, 95% CI:0.07-0.46).. Functional genomics analysis demonstrated that this variant creates a miR-651 binding site resulting in altered miRNA regulation of FAS, providing biological plausibility for the observed association. We then analyzed these associations in 598 late-stage patients. After multiple comparison corrections, no SNPs remained significant in the late stage group, while the top SNP NAT1:rs15561 (HR=1.98, 96%CI=1.32-2.94) conferred a significantly increased risk of death in the chemotherapy subgroup. To test the hypothesis that genetic variants in the inflammation-related pathways may be associated with survival in NSCLC patients, we first conducted a three-stage study. In the discovery phase, we investigated a comprehensive panel of 11,930 inflammation-related SNPs in three independent lung cancer populations. A missense SNP (rs2071554) in HLA-DOB was significantly associated with poor survival in the discovery population (HR: 1.46, 95% CI: 1.02-2.09), internal validation population (HR: 1.51, 95% CI: 1.02-2.25), and external validation (HR: 1.52, 95% CI: 1.01-2.29) population. Rs2900420 in KLRK1 was significantly associated with a reduced risk for death in the discovery (HR: 0.76, 95% CI: 0.60-0.96) and internal validation (HR: 0.77, 95% CI: 0.61-0.99) populations, and the association reached borderline significance in the external validation population (HR: 0.80, 95% CI: 0.63-1.02). We also evaluated these inflammation-related SNPs in NSCLC patients in never smokers. Lung cancer in never smokers has been increasingly recognized as distinct disease from that in ever-smokers. A two-stage study was performed using a discovery population from MD Anderson (411 patients) and a validation population from Mayo Clinic (311 patients). Three SNPs (IL17RA:rs879576, BMP8A:rs698141, and STK:rs290229) that were significantly associated with survival were validated (pCD74:rs1056400 and CD38:rs10805347) were borderline significant (p=0.08) in the Mayo Clinic population. In the combined analysis, IL17RA:rs879576 resulted in a 40% reduction in the risk for death (p=4.1 × 10-5 [p=0.61, heterogeneity test]). We also validated a survival tree created in MD Anderson population in the Mayo Clinic population. In conclusion, our results provided strong evidence that genetic variations in specific pathways that examined (miRNA and inflammation pathways) influenced clinical outcomes in NSCLC patients, and with further functional studies, the novel loci have potential to be translated into clinical use.

Characteristics of a spina bifida population including North American Caucasian and Hispanic individuals.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

BACKGROUND: Meningomyelocele (MM) is a common human birth defect. MM is a disorder of neural development caused by contributions from genes and environmental factors that result in the NTD and lead to a spectrum of physical and neurocognitive phenotypes. METHODS: A multidisciplinary approach has been taken to develop a comprehensive understanding of MM through collaborative efforts from investigators specializing in genetics, development, brain imaging, and neurocognitive outcome. Patients have been recruited from five different sites: Houston and the Texas-Mexico border area; Toronto, Canada; Los Angeles, California; and Lexington, Kentucky. Genetic risk factors for MM have been assessed by genotyping and association testing using the transmission disequilibrium test. RESULTS: A total of 509 affected child/parent trios and 309 affected child/parent duos have been enrolled to date for genetic association studies. Subsets of the patients have also been enrolled for studies assessing development, brain imaging, and neurocognitive outcomes. The study recruited two major ethnic groups, with 45.9% Hispanics of Mexican descent and 36.2% North American Caucasians of European descent. The remaining patients are African-American, South and Central American, Native American, and Asian. Studies of this group of patients have already discovered distinct corpus callosum morphology and neurocognitive deficits that associate with MM. We have identified maternal MTHFR 667T allele as a risk factor for MM. In addition, we also found that several genes for glucose transport and metabolism are potential risk factors for MM. CONCLUSIONS: The enrolled patient population provides a valuable resource for elucidating the disease characteristics and mechanisms for MM development.

Nonlinear tests for genetic studies of complex disease

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Linkage and association studies are major analytical tools to search for susceptibility genes for complex diseases. With the availability of large collection of single nucleotide polymorphisms (SNPs) and the rapid progresses for high throughput genotyping technologies, together with the ambitious goals of the International HapMap Project, genetic markers covering the whole genome will be available for genome-wide linkage and association studies. In order not to inflate the type I error rate in performing genome-wide linkage and association studies, multiple adjustment for the significant level for each independent linkage and/or association test is required, and this has led to the suggestion of genome-wide significant cut-off as low as 5 × 10 −7. Almost no linkage and/or association study can meet such a stringent threshold by the standard statistical methods. Developing new statistics with high power is urgently needed to tackle this problem. This dissertation proposes and explores a class of novel test statistics that can be used in both population-based and family-based genetic data by employing a completely new strategy, which uses nonlinear transformation of the sample means to construct test statistics for linkage and association studies. Extensive simulation studies are used to illustrate the properties of the nonlinear test statistics. Power calculations are performed using both analytical and empirical methods. Finally, real data sets are analyzed with the nonlinear test statistics. Results show that the nonlinear test statistics have correct type I error rates, and most of the studied nonlinear test statistics have higher power than the standard chi-square test. This dissertation introduces a new idea to design novel test statistics with high power and might open new ways to mapping susceptibility genes for complex diseases. ^

Identification of genetic variation influencing apolipoprotein E levels: Follow-up of genome-wide linkage scans in the GENOA study

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Apolipoprotein E (ApoE) plays a major role in the metabolism of high density and low density lipoproteins (HDL and LDL). Its common protein isoforms (E2, E3, E4) are risk factors for coronary artery disease (CAD) and explain between 16 to 23% of the inter-individual variation in plasma apoE levels. Linkage analysis has been completed for plasma apoE levels in the GENOA study (Genetic Epidemiology Network of Atherosclerosis). After stratification of the population by lipoprotein levels and body mass index (BMI) to create more homogeneity with regard to biological context for apoE levels, Hispanic families showed significant linkage on chromosome 17q for two strata (LOD=2.93 at 104 cM for a low cholesterol group, LOD=3.04 at 111 cM for a low cholesterol, high HDLC group). Replication of 17q linkage was observed for apoB and apoE levels in the unstratified Hispanic and African-American populations, and for apoE levels in African-American families. Replication of this 17q linkage in different populations and strata provides strong support for the presence of gene(s) in this region with significant roles in the determination of inter-individual variation in plasma apoE levels. Through a positional and functional candidate gene approach, ten genes were identified in the 17q linked region, and 62 polymorphisms in these genes were genotyped in the GENOA families. Association analysis was performed with FBAT, GEE, and variance-component based tests followed by conditional linkage analysis. Association studies with partial coverage of TagSNPs in the gene coding for apolipoprotein H (APOH) were performed, and significant results were found for 2 SNPs (APOH_20951 and APOH_05407) in the Hispanic low cholesterol strata accounting for 3.49% of the inter-individual variation in plasma apoE levels. Among the other candidate genes, we identified a haplotype block in the ACE1 gene that contains two major haplotypes associated with apoE levels as well as total cholesterol, apoB and LDLC levels in the unstratified Hispanic population. Identifying genes responsible for the remaining 60% of inter-individual variation in plasma apoE level, will yield new insights into the understanding of genetic interactions involved in the lipid metabolism, and a more precise understanding of the risk factors leading to CAD. ^

Systematic review of genetic risk score in coronary heart disease and other diseases

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In order to better take advantage of the abundant results from large-scale genomic association studies, investigators are turning to a genetic risk score (GRS) method in order to combine the information from common modest-effect risk alleles into an efficient risk assessment statistic. The statistical properties of these GRSs are poorly understood. As a first step toward a better understanding of GRSs, a systematic analysis of recent investigations using a GRS was undertaken. GRS studies were searched in the areas of coronary heart disease (CHD), cancer, and other common diseases using bibliographic databases and by hand-searching reference lists and journals. Twenty-one independent case-control studies, cohort studies, and simulation studies (12 in CHD, 9 in other diseases) were identified. The underlying statistical assumptions of the GRS using the experience of the Framingham risk score were investigated. Improvements in the construction of a GRS guided by the concept of composite indicators are discussed. The GRS will be a promising risk assessment tool to improve prediction and diagnosis of common diseases.^

A GENETIC MARKER STUDY OF NON - INSULIN DEPENDENT DIABETICS IN THE MEXICAN-AMERICAN COMMUNITY FROM STARR COUNTY, TEXAS (PLASMA GLUCOSE, HEMOGLOBIN ANALYSIS)

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Diabetes mellitus occurs in two forms, insulin-dependent (IDDM, formerly called juvenile type) and non-insulin dependent (NIDDM, formerly called adult type). Prevalence figures from around the world for NIDDM, show that all societies and all races are affected; although uncommon in some populations (.4%), it is common (10%) or very common (40%) in others (Tables 1 and 2).^ In Mexican-Americans in particular, the prevalence rates (7-10%) are intermediate to those in Caucasians (1-2%) and Amerindians (35%). Information about the distribution of the disease and identification of high risk groups for developing glucose intolerance or its vascular manifestations by the study of genetic markers will help to clarify and solve some of the problems from the public health and the genetic point of view.^ This research was designed to examine two general areas in relation to NIDDM. The first aims to determine the prevalence of polymorphic genetic markers in two groups distinguished by the presence or absence of diabetes and to observe if there are any genetic marker-disease association (univariate analysis using two by two tables and logistic regression to study the individual and joint effects of the different variables). The second deals with the effect of genetic differences on the variation in fasting plasma glucose and percent glycosylated hemoglobin (HbAl) (analysis of Covariance for each marker, using age and sex as covariates).^ The results from the first analysis were not statistically significant at the corrected p value of 0.003 given the number of tests that were performed. From the analysis of covariance of all the markers studied, only Duffy and Phosphoglucomutase were statistically significant but poor predictors, given that the amount they explain in terms of variation in glycosylated hemoglobin is very small.^ Trying to determine the polygenic component of chronic disease is not an easy task. This study confirms the fact that a larger and random or representative sample is needed to be able to detect differences in the prevalence of a marker for association studies and in the genetic contribution to the variation in glucose and glycosylated hemoglobin. The importance that ethnic homogeneity in the groups studied and standardization in the methodology will have on the results has been stressed. ^

Network-based genome-wide association study of Crohn's disease

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Genome-wide association studies (GWAS) have rapidly become a standard method for disease gene discovery. Many recent GWAS indicate that for most disorders, only a few common variants are implicated and the associated SNPs explain only a small fraction of the genetic risk. The current study incorporated gene network information into gene-based analysis of GWAS data for Crohn's disease (CD). The purpose was to develop statistical models to boost the power of identifying disease-associated genes and gene subnetworks by maximizing the use of existing biological knowledge from multiple sources. The results revealed that Markov random field (MRF) based mixture model incorporating direct neighborhood information from a single gene network is not efficient in identifying CD-related genes based on the GWAS data. The incorporation of solely direct neighborhood information might lead to the low efficiency of these models. Alternative MRF models looking beyond direct neighboring information are necessary to be developed in the future for the purpose of this study.^

Identification of genetic variants that influence atherosclerosis in young persons

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Atherosclerosis is widely accepted as a complex genetic phenotype and is the usual cause of cardiovascular disease, the world’s leading killer. Genetic factors have been proven to be important risk contributors for atherosclerosis and much work has been done to identify promising candidates that might play a role in the development of atherosclerosis. It is well known that many independent replications are needed to unequivocally establish a valid genotype-phenotype association across different populations before the findings are extended to clinical settings and to the expensive follow-up studies designed to identify causal genetic variants. Aiming to replicate the association with atherosclerosis in the Pathobiological Determinants of Atherosclerosis in Youth (PDAY) study, we assessed the relationship of 32 atherosclerosis candidate SNPs to atherosclerosis in the PDAY cohort, consisting of AA and EA young people aged 15-34 years who died of non-medical causes. Two association studies, a whole sample study and a 1:1 matched case control study were performed by use of multiple linear regression and logistic regression analyses, respectively. For the whole sample association study, 32 SNPs among 2,650 individuals (1,369 AA and 1,281 EA) were tested for the association with six early atherosclerosis phenotypes: abdominal aorta fatty streaks, abdominal aorta raised lesions, right coronary artery fatty streaks, right coronary artery raised lesions, thoracic aorta fatty streaks, and thoracic aorta raised lesions. For the matched case-control association study, 337 case-control paired samples were included; cases were chosen with the highest total raised lesion scores from the studied population, while controls were randomly selected from individuals that had no raised lesions and matched to cases by age, gender and race. Sixteen SNPs in 13 genes were found to be significantly associated with atherosclerosis in at least one of the PDAY association studies. Among these 16 findings: eight SNPs (rs9579646, rs6053733, rs3849150, rs10499903, rs2148079, rs5073691, rs10116277, and rs17228212) successfully replicated previous results, six SNPs (rs17222814, rs10811661, rs7028570, rs7291467, rs16996148 and rs10401969) were reported as new findings exclusive to our study, the last two of the 16 SNPs, rs501120 and rs6922269, showed either intriguing or conflicting result. SNP rs17222814 in ALOX5AP and SNP rs3849150 in LRRC18 were consistently associated with atherosclerosis in both prior and the two PDAY association studies. SNP rs3849150 was also identified to be highly correlated with a non-synonymous coding SNP, rs17772611, which may damage the protein (polyphen score = 0.996), suggesting that SNP rs17772611 may be the causal functional variant.^ In conclusion, our study added more support for the association of these candidate genes with atherosclerosis. SNPs rs3849150 and rs17772611 of LRRC18, as well as SNP rs17222814 of ALOX5AP, were the most significant findings from our study, and may be ranked among the best for further study.^

Comparison of two statistical methods for rare variant association analysis

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Studies have shown that rare genetic variants have stronger effects in predisposing common diseases, and several statistical methods have been developed for association studies involving rare variants. In order to better understand how these statistical methods perform, we seek to compare two recently developed rare variant statistical methods (VT and C-alpha) on 10,000 simulated re-sequencing data sets with disease status and the corresponding 10,000 simulated null data sets. The SLC1A1 gene has been suggested to be associated with diastolic blood pressure (DBP) in previous studies. In the current study, we applied VT and C-alpha methods to the empirical re-sequencing data for the SLC1A1 gene from 300 whites and 200 blacks. We found that VT method obtains higher power and performs better than C-alpha method with the simulated data we used. The type I errors were well-controlled for both methods. In addition, both VT and C-alpha methods suggested no statistical evidence for the association between the SLC1A1 gene and DBP. Overall, our findings provided an important comparison of the two statistical methods for future reference and provided preliminary and pioneer findings on the association between the SLC1A1 gene and blood pressure.^

«
1
2
3
»